A Morphological Processor for Malayalam Language
نویسندگان
چکیده
Work on morphological analyzers (which are computer programmes) for Indian languages is conducted vigorously these days. Usually published in specialized journals, this rather technical work is briefly presented here to provide some insights to a wider readership into little-known aspects of current language work. The morphological strength of Malayalam as a major South Indian language justifies the use of thorough morphological processing, which is the first step in any natural language processing task. The project is aimed at building a morphological processor for language, with two main components: a morphological generator and a morphological analyzer. The computational model of the processor takes care of the processing of nouns, pronouns, verbs and modifiers. The results obtained are encouraging and the work can be extended to the creation of a full-fledged part-of-speech tagger for Malayalam and other Dravidian languages, since they all exhibit structural homogeneity.
منابع مشابه
Unity in Diversity: A Unified Parsing Strategy for Major Indian Languages
This paper presents our work to apply non linear neural network for parsing five r esource p oor I ndian L anguages belonging to two major language families Indo-Aryan and Dravidian. Bengali and Marathi are Indo-Aryan languages whereas Kannada, Telugu and Malayalam belong to the Dravidian family. While little work has been done previously on Bengali and Telugu linear transition-based parsing, w...
متن کاملAutomated Plagiarism Detection System for Malayalam Text Documents
In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detectio...
متن کاملAutomated Plagiarism Detection System for Malayalam Text Documents
In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detectio...
متن کاملAutomated Plagiarism Detection System for Malayalam Text Documents
In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detectio...
متن کاملClause Boundary Identification for Malayalam Using CRF
This paper presents a clause boundary identification system for Malayalam sentences using the machine learning approach CRF (Conditional Random Field).Malayalam Language is considered as a 'Left branching language' where verbs are seen at the end of the sentence. Clause boundary identification plays a vital role in many NLP applications and for Malayalam language, the clause boundary identifica...
متن کامل